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ABSTRACT 

We  propose  a  channel-matched  hierarchical  table-lookup 
vector  quantizer  (CM-HTVQ)  which  provides  some  ro¬ 
bustness  against  the  channel  noise.  We  use  a  finite-state 
channel  to  model  slow  fading  channels  and  propose  an 
adaptive  coding  scheme  to  transmit  a  source  over  wire¬ 
less  channels.  The  performance  of  CM-HTVQ  is  in  gen¬ 
eral  slightly  inferior  to  that  of  channel- optimized  vector 
quantizer  (COVQ)  (the  performances  coincide  at  some 
cases);  however,  the  encoder  complexity  of  CM-HTVQ 
is  much  less  than  the  encoder  complexity  of  COVQ. 

l.INTRODUCTION 

Vector  quantization  is  a  powerful  tool  for  source  coding 
which  has  been  used  in  many  speech  and  image  coding 
systems  [1].  The  encoder  of  a  vector  quantizer  (VQ) 
is  usually  implemented  by  computing  the  distortion  be¬ 
tween  the  input  vector  and  each  codevector  in  the  code¬ 
book  and  finding  the  codevector  which  results  in  mini¬ 
mum  distortion.  The  decoder  however  is  a  simple  table 
lookup.  In  [2],  Chang  et  al.  have  proposed  a  hierarchi¬ 
cal  table-lookup  vector  quantizer  (HTVQ)  in  which  the 
encoder  is  implemented  using  a  table-lookup  in  multi¬ 
ple  stages.  For  every  possible  input  vector,  the  table- 
lookups  store  the  codeword  of  the  nearest  codevector. 
Then,  in  the  process  of  encoding,  input  vectors  to  the 
encoder  are  used  directly  as  addresses  for  the  tables  to 
select  the  best  codeword.  The  entire  idea  is  predicated 
on  the  assumption  that  there  are  a  finite  number  of  in¬ 
put  vectors,  i.e.,  each  input  sample  is  already  quantized 
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-  a  reasonable  assumption  in  most  practical  situations. 
Assuming  n-bit  input  samples  and  a  A:-dimensional  VQ, 
the  size  of  such  a  table  is  2”^  times  the  number  of  output 
bits.  To  make  sure  that  the  table  sizes  do  not  exceed 
manageable  limits  for  large  dimension  VQ’s,  [2]  adopts 
a  hierarchical  structure  as  shown  in  Fig.  1.  The  input 
address  of  Table  *  -|-  1  in  Fig.  1  is  constructed  by  com¬ 
bining  outputs  of  Table  i.  The  most  practical  con¬ 
figuration  \ski  =2  V*  which  is  assumed  throughout  this 
paper.  When  applied  to  speech  coding,  the  system  in  [2] 
is  shown  to  suffer  only  a  1  dB  performance  degradation 
compared  with  a  full-search  VQ  while  the  encoding  com¬ 
plexity  is  dramatically  reduced.  Vishwanath  and  Chou 
have  used  the  same  system  for  video  coding  [3].  They 
also  combine  HTVQ  with  a  discrete  wavelet  transform 
to  improve  the  performance  of  the  HTVQ.  They  demon¬ 
strate  the  desirable  characteristics  of  their  system  for 
“interactive  multicast  over  multiple  rate  channels.” 

In  this  paper,  we  study  the  performance  of  the  scheme 
proposed  by  Vishwanath  and  Chou  over  noisy  channels 
and  propose  a  channel-matched  HTVQ  (CM-HTVQ) 
which  provides  the  simplicity  of  an  HTVQ,  i.e.,  table- 
lookup  encoding  and  decoding,  and  robustness  against 
channel  noise.  We  present  results  for  memoryless  Gaus¬ 
sian  sources,  still  images,  and  video  over  memory  less  bi¬ 
nary  symmetric  channels  (BSC’s)  and  wireless  channels 
with  slow  lognormal  fading.  Section  2  provides  the  de¬ 
sign  procedure  of  a  CM-HTVQ  for  BSC’s  and  presents 
numerical  results.  In  Section  3,  the  results  of  Section  2 
are  extended  to  a  finite-state  channel  (FSC).  Section  4 
provides  concluding  remarks. 

2.CM-HTVQ  FOR  BSC’S 

The  basic  idea  behind  channel-optimized  VQ  (COVQ) 
is  to  design  the  VQ  encoder  and  decoder  such  that  the 
end-to-end  average  distortion  after  encoding,  transmis¬ 
sion  over  the  channel,  and  decoding  is  minimized  [4]. 
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The  optimization  is  done  for  a  given  source,  a  given 
noisy  channel,  a  fixed  dimension  k,  and  a  fixed  codebook 
size  M.  The  design  algorithm  is  a  modified  generalized 
Lloyd  algorithm,  with  the  following  expressions  for  the 
optimal  partition,  consisting  of  encoding  cells,  S'j’s,  and 
optimal  codevectors,  Cj’s  (squared-error  distortion  mea¬ 
sure  is  assumed)  [4]: 

Y.f=tP{m  l|x-c,  P,  Ml}, 

and 

^  E»=0  ^  ^(jp)  4  xp(x)dx 

where  P{j\i)  denotes  the  probability  that  j  is  received 
given  that  i  is  transmitted,  p(x)  is  the  A:-fold  probability 
density  function  of  the  source,  and  *,  j  G  {0, 1,  •  •  • ,  M  — 
1}.  It  is  a  well  known  fact  that  the  performance  of 
COVQ  is  better  than  that  of  VQ  when  the  channel  is 
noisy. 

Like  a  regular  VQ,  a  COVQ  suffers  from  the  high 
encoding  complexity  problem.  To  use  the  hierarchical 
structure  of  an  HTVQ  and  design  a  COVQ  which  uses 
lookup  tables,  first  we  consider  the  case  of  an  V-stage 
HTVQ  and  a  memoryless  BSC  with  a  crossover  proba¬ 
bility  of  e  (the  transition  probability  P{j\i)  can  be  com¬ 
puted  when  e  is  known). 

Since  the  purpose  of  the  first  V— 1  stages  is  to  provide 
the  best  pair  of  addresses  for  the  stage,  the  design 
procedure  for  these  stages  remains  unchanged.  The  last 
stage  however  must  be  adapted  to  the  characteristics  of 
the  channel.  So,  we  use  a  COVQ  to  design  the  lookup 
table  of  the  last  stage.  We  call  this  structure  CM-HTVQ 
because  the  last  stage  of  the  HTVQ  is  matched  to  the 
characteristics  of  the  channel.  Note  that  CM-HTVQ  is 
not  optimal  since  the  design  of  the  intermediate  stages, 
even  for  noiseless  channels,  is  not  optimal  [2]. 

Tables  1  and  2  show  the  performance  of  HTVQ, 
VQ,  CM-HTVQ,  and  COVQ  for  a  memoryless  Gaus¬ 
sian  source  at  rates  0.5  and  1  bit/sample  (compression 
ratio  (CR)  16:1  and  8:1),  respectively.  We  have  used 
simulated  annealing  to  assign  indices  [5]  to  the  codevec¬ 
tors  of  VQ  and  HTVQ.  The  source  consists  of  samples 
from  a  memoryless  Gaussian  source  quantized  using  an 
8-bit  Lloyd-Max  scalar  quantizer.  As  it  is  clear  from 
the  tables,  the  difference  between  the  performance  of 
COVQ  and  CM-HTVQ  is  less  than  the  gap  between 
VQ  and  HTVQ  for  a  noiseless  channel  (e  =  0.0).  The 
gap  shrinks  as  the  channel  becomes  noisier.  The  same 
observation  can  be  made  based  on  the  simulation  re¬ 
sults  for  the  512  x  512  Lenna  (Tables  3  and  4).  In  this 


case,  our  quantizers  are  designed  using  five  USC  images 
(Couple,  Crowd,  Man,  Womanl,  and  Woman2)  as  the 
training  sequence.  The  simulation  results  are  obtained 
by  computing  the  average  results  of  10  experiments. 

Similar  results  are  obtained  for  the  360  x  288  “Sales¬ 
man”  video  sequence.  Table  5.  To  quantize  a  video  se¬ 
quence,  we  use  two  quantizers.  One  of  them  quantizes 
frame  differences  and  the  other  one  compresses  the  orig¬ 
inal  frames  every  5  frames.  To  quantize  the  original 
frames,  we  use  the  quantizer  designed  for  still  images  at 
a  bit  rate  r  =  1  bit  per  pixel  (bpp).  The  other  quan¬ 
tizer  is  designed  using  ten  frame  differences  obtained 
from  the  “Miss  America”  video  sequence  and  operates 
at  a  bit  rate  r  =  0.5  bpp. 

It  is  also  clear  from  the  tables  that  CM-HTVQ 
demonstrates  some  robustness  against  the  channel  noise 
(compared  with  HTVQ  and  VQ).  Thus,  for  the  case  of 
a  memoryless  BSC,  not  only  does  CM-HTVQ  provide 
a  simple  table-lookup  encoding,  but  also  it  achieves  al¬ 
most  the  same  performance  as  that  of  COVQ. 

3.CM-HTVQ  FOR  FADING  CHANNELS 

To  tackle  the  same  problem  for  a  wireless  channel,  we 
model  the  channel  by  an  FSC  with  a  BSC  associated 
with  each  state  and  a  Markov  chain  governing  the  tran¬ 
sition  between  the  states  [6].  A  simple  way  to  model 
a  slowly-varying  fading  channel  by  an  FSC  is  to  divide 
the  range  of  the  received  signal-to-noise  ratios  (SNR’s) 
into  a  number  of  equiprobable  intervals,  representing 
the  states,  and  model  each  state  by  a  BSC.  The  de¬ 
tails  for  a  lognormal  fading  channel  employing  uncoded 
binary  phase-shift  keying  as  the  signaling  system  can 
be  found  in  [7].  In  this  work,  we  assume  that  both  the 
transmitter  and  receiver  have  access  to  the  channel  state 
information  (CSI).  Repeating  the  above  design  proce¬ 
dure  for  each  of  the  constituent  channels  of  the  FSC 
and  using  the  appropriate  table  based  on  the  available 
CSI  lead  to  a  CM-HTVQ  system  which  is  matched  to 
the  fading  channel  (Fig.  2). 

The  main  advantage  of  HTVQ,  other  than  table- 
lookup  encoding,  is  the  simplicity  of  transcoding  from 
a  higher  bit  rate  to  a  lower  bit  rate  code.  This  can 
be  done  by  simply  adding  new  stages  to  the  HTVQ, 
basically  amounting  to  adding  memory  to  the  hard¬ 
ware.  One  important  application  in  which  this  table- 
lookup  transcoding  might  prove  useful  is  multicasting. 
Let  us  think  of  the  connection  between  the  consecutive 
stages  of  HTVQ  as  a  channel.  These  channels  might  be 
noiseless  channels,  memoryless  BSC’s,  or  slowly-varying 
wireless  channels.  Such  a  model  supports  a  broad  range 
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of  scenarios  such  as  the  example  provided  in  Fig.  3,  illus¬ 
trating  a  “heterogeneous”  network  in  which  the  trans¬ 
mitter  sends  a  bit  stream  which  is  compressed  using  an 
HTVQ  with  a  CR  of  4:1.  At  Node  A,  the  bit  stream  is 
retransmitted  to  Nodes  B  and  C.  The  channel  between 
Nodes  A  and  B  is  a  lognormal  fading  channel,  modeled 
by  an  FSC,  operating  at  a  bit  rate  corresponding  to  a 
CR  of  8:1;  the  channel  between  Nodes  A  and  C  is  a 
memoryless  BSC  with  bit  error  rate  ei  =  0.1  operating 
at  a  bit  rate  corresponding  to  a  CR  of  4:1.  The  goal 
is  to  design  a  system  which  is  capable  of  transcoding 
the  bit  stream  by  table-lookups  (without  an  expensive 
decoding  followed  by  encoding)  at  Node  A  which  pro¬ 
vides  robustness  over  the  noisy  channels  of  links  AB  and 
AC.  Note  that,  in  general,  the  system  must  be  able  to 
handle  any  other  rate  reduction  or  channel  combination 
only  by  using  lookup  tables. 

Our  approach  is  to  use  COVQ’s  to  design  the  CM- 
HTVQ  encoding  and  decoding  tables;  the  encoding  ta¬ 
bles  are  then  used  at  the  origination  nodes  for  transcod¬ 
ing  and  the  decoding  tables  are  used  at  the  destination 
nodes  for  the  decoding.  To  explain  this  concept,  let  us 
concentrate  on  the  example  of  Fig.  3  where  we  need  a  2- 
stage  HTVQ  to  transmit  the  source  to  Node  A.  Since  the 
output  of  the  first  stage  is  directly  used  as  the  address 
for  the  second  stage,  the  imaginary  channel  between  the 
two  stages  is  noiseless.  Also,  the  channel  connected  to 
Node  A  is  a  noiseless  channel;  so,  a  regular  HTVQ  with 
a  CR  of  4:1  can  be  used  to  transmit  the  source  to  Node 
A.  At  Node  A,  we  need  to  generate  two  bit  streams,  one 
for  transmission  to  Node  B  and  another  one  for  Node 
C.  The  transcoding  is  performed  using  lookup  tables 
designed  as  follows. 

Let  us  assume  that  the  average  received  SNR  of  the 
fading  channel  in  link  AB  is  10  dB  and  we  use  an  FSC 
with  two  states  to  model  the  channel  (Channel  1  in  [7]). 
As  it  is  described  in  [7],  the  FSC  model  consists  of  a 
BSC  with  probability  of  error  equal  to  0.1  and  a  noise¬ 
less  channel.  State  probabilities  are  0.238  and  0.762, 
respectively.  Two  8-dimensional  COVQ’s  with  a  CR  of 
8:1  are  designed  for  the  two  constituent  BSC’s  of  the 
FSC  and  a  lookup  table  is  designed  for  each  BSC.  To 
transmit  the  sequence  over  link  AB,  at  each  time  in¬ 
stant,  the  appropriate  encoding  table  is  used  based  on 
the  CSI  (Fig.  2).  Likewise,  a  4-dimensional  COVQ  with 
a  CR  of  4:1  is  designed  for  a  memoryless  BSC  with 
probability  of  error  equal  to  ei  =  0.1.  Such  a  COVQ 
is  used  to  design  the  encoding  table  for  transcoding  the 
sequence  for  link  AC.  The  table  provides  a  new  encod¬ 
ing  map  that  is  matched  to  the  BSC.  Note  that  this 


table  is  not  used  to  compress  the  bit  stream  (because 
of  the  specific  rate  assumptions  in  the  example),  rather 
it  is  used  to  match  the  encoded  sequence  to  the  char¬ 
acteristics  of  the  communication  link.  So,  the  number 
of  output  bits  for  this  table  is  equal  to  the  number  of 
input  bits.  Such  a  table  is  not  restricted  to  HTVQ  and 
can  be  used  for  regular  VQ’s  as  well. 

The  decoding  table  associated  with  the  appropriately 
designed  COVQ  must  be  used  for  decoding  at  Nodes  B 
and  C.  Table  6  provides  the  received  peak  SNR  (PSNR) 
at  each  node  for  the  512  x  512  Lenna.  Also,  we  report 
the  results  for  the  case  that  the  tables  are  not  matched 
to  the  characteristics  of  the  channel.  Note  that  the  re¬ 
sult  at  Node  C  is  different  from  the  performance  of  a 
system  which  consists  of  a  2-stage  CM-HTVQ  (designed 
for  the  BSC  in  link  AC)  and  a  hypothetical  BSC  con¬ 
necting  the  source  to  Node  C  directly.  This  is  because 
in  the  the  example  of  Fig.  3,  we  need  two  reconstructed 
versions  of  the  source  at  Nodes  A  and  C.  Our  simula¬ 
tion  results  indicate  that  we  only  pay  0.14  dB  penalty 
in  the  performance  at  Node  C  to  generate  a  replica  of 
the  source  at  Node  A.  Also,  note  that  if  we  do  not  use 
the  “matching  table”  at  Node  A  and  transmit  the  re¬ 
ceived  bit  stream  at  Node  A  directly  to  Node  C,  the 
PSNR  result  at  Node  C  degrades  by  about  9.1  dB.  If 
we  use  regular  VQ’s  for  quantization,  we  can  still  uti¬ 
lize  a  256-byte  matching  table  at  Node  A  and  improve 
the  performance  by  about  9.19  dB  (Table  6).  The  result 
would  be  the  same  if  we  decoded  the  sequence  at  Node 
A  and  re-encoded  it  using  a  COVQ.  So,  the  matching 
table  provides  the  optimal  bit  stream  by  using  simple 
lookup  tables  instead  of  decoding  and  encoding  the  se¬ 
quence.  Again  our  simulations  indicate  that  if  we  wish 
to  use  a  regular  VQ,  creating  two  versions  of  the  source 
at  Nodes  A  and  C  degrades  the  result  by  only  about 
0.05  dB  compared  to  the  case  that  we  only  generate  the 
output  at  Node  C.  Similar  results  have  been  obtained 
for  other  channels  we  have  studied. 

4.CONCLUSIONS 

We  have  proposed  a  CM-HTVQ  which  outperforms 
HTVQ  when  the  channel  is  noisy.  Also,  we  have  used 
an  FSC  to  model  slow  fading  channels  and  proposed  an 
adaptive  coding  scheme  by  using  a  set  of  encoder  and  de¬ 
coder  tables.  The  performance  of  CM-HTVQ  is  slightly 
inferior  to  that  of  COVQ  (its  complexity  is  much  less); 
however,  the  performance  gap  between  CM-HTVQ  and 
COVQ  is  much  less  than  the  performance  gap  between 
HTVQ  and  VQ.  The  gap  shrinks  as  the  channel  becomes 
noisier.  CM-HTVQ  can  also  be  designed  for  struc- 
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tured  VQ’s  like  tree-structured  VQ,  multi-stage  VQ, 
and  finite-state  VQ.  Using  table-lookups  for  the  encoder 
of  different  types  of  VQ’s,  assuming  a  noiseless  channel, 
has  been  studied  in  [8].  To  design  a  channel-matched 
version,  the  VQ  which  is  used  for  designing  the  last 
stage  must  be  substituted  with  a  COVQ.  Future  work 
includes  the  transmission  over  a  wireless  channel  when 
the  CSI  is  only  available  at  the  decoder.  For  this  case, 
we  have  found  an  optimal  solution  for  COVQ’s.  The 
same  approach  has  been  applied  to  the  CM-HTVQ’s.  ^ 
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e 

VQ 

GOVQ 

HTVQ 

GM-HTVQ 

0.00 

2.33 

2.33 

1.18 

1.18 

0.01 

1.91 

2.02 

0.93 

1.04 

0.03 

1.25 

1.67 

0.49 

0.88 

0.05 

0.74 

1.43 

0.15 

0.77 

0.10 

-0.12 

1.00 

-0.45 

0.55 

Table  1:  Memoryless  Gaussian  Source  (SNR  in  dB); 
GR=16:1;  block  size=16. 


e 

VQ 

GOVQ 

HTVQ 

GM-HTVQ 

0.00 

4.85 

4.85 

4.05 

4.05 

0.01 

3.82 

4.21 

3.22 

3.57 

0.03 

2.42 

3.59 

2.02 

3.08 

0.05 

1.45 

3.14 

1.19 

2.72 

0.10 

0.00 

2.26 

-0.14 

1.98 

Table  2:  Memoryless  Gaussian  Source  (SNR  in  dB); 
GR=8:1;  block  size=8. 


e 

VQ 

GOVQ 

HTVQ 

GM-HTVQ 

0.00 

30.49 

30.49 

29.69 

29.69 

0.01 

24.05 

28.24 

23.79 

27.79 

0.03 

20.02 

26.40 

19.94 

26.35 

0.05 

17.99 

25.22 

18.01 

25.13 

0.10 

15.27 

23.02 

15.26 

22.99 

Table  3:  Lenna  (PSNR  in  dB);  GR=16:1;  block  size=4x 
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e 

VQ 

COVQ 

HTVQ 

CM-HTVQ 

0.00 

32.47 

32.47 

31.79 

31.79 

0.01 

24.13 

29.48 

23.97 

29.20 

0.03 

19.90 

27.46 

19.88 

27.32 

0.05 

17.89 

26.09 

17.83 

26.00 

0.10 

15.12 

23.40 

15.06 

23.35 

Table  4:  Lenna  (PSNR  in  dB);  CR=8:1;  block  size=4  x 

2. 


e 

VQ 

COVQ 

HTVQ 

CM-HTVQ 

0.00 

31.09 

31.09 

29.48 

29.48 

0.01 

25.92 

29.42 

25.15 

28.79 

0.03 

21.72 

28.32 

21.49 

27.72 

0.05 

19.49 

27.06 

19.36 

26.58 

0.10 

16.09 

24.18 

16.08 

23.98 

CSI 


Figure  2:  CM-HTVQ  for  FSC’s. 


Table  5:  Salesman  sequence  (average  PSNR  in  dB  for 
100  frames);  r  =  0.6  bpp. 


Figure  1:  HTVQ  Encoder. 


Node 

VQ 

COVQ 

HTVQ 

CM-HTVQ 

A 

35.80 

35.80 

35.18 

35.18 

B 

21.11 

28.18 

21.01 

27.95 

C 

15.04 

24.23 

15.04 

24.14 

Figure  3:  Example. 


Table  6:  Lenna  (PSNR  in  dB). 
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